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A Sys-fcem For Browsing The World Wide Web 
With A TraditionsJ. Telephone 



Background of the Invention 

Field of the Invention 

The present invention relates to interfacing with 
the world wide web and more particularly to browsing the 
world wide web using a telephone. 

15 Description of the Prior Art 

The World Wide Web (WWW) is rapidly becoming the 
single most important source of information for 
businesses and consumers. As individuals rely 

20 increasingly on the information available on the WWW, 

they will require ubiquitous access to this information. 
One device that is readily available in almost any 
environment is the telephone; thus, it seems natural to 
consider the telephone as a WWW access device. 

25 Traditional WWW browsers, such as Netscape's 

Navigator and Microsoft's Internet Explorer, offer 
complex visual renditions of WWW documents. These are 
not suitable for telephones because telephones lack 
sophisticated visual display mechanisms. Some telephones 

30 have small visual displays and some vendors offer WWW 
browsers targeted to these limited displays. The best 
example of such a browser is Unwired Planet's UP. Browser. 
These systems are still dependent on a visual display, 
however, and thus are not usable on traditional 

35 telephones which have no visual display. 
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The Web-On-Call system from Netphonic allows 
telephonic access to the WWW by providing an audio 
rendering of WWW documents. However, this system 
requires the WWW content-provider to modify, or pre- 
5 process, all of the content which they wish to make 
available. Such a system is called a server-side 
solution and does not enable individual WWW users to 
browse to arbitrary WWW sites. Users may only browse 
sites that have the special modifications. In order to 

10 allow arbitrary browsing, a client-side solution is 

needed, one which renders arbitrary WWW documents into 
audio on the fly. 

The Web-Based Interactive Radio Environment: WIRE 
system developed by Siemens Corporate Research Inc., 

15 described in United States Patent Application Number 

08/768,046, filed on December 13, 1996 and assigned to 
the same assignee as the present invention, offers a 
mechanism for rendering arbitrary WWW documents using 
audio. As the WIRE system offers broad rendering support 

20 for structured documents as well as audio data, it is a 
natural tool for building client-side, audio-only, WWW 
browsers . 

Summary of the Invention 

25 

The present invention is a system which employs the 
WIRE system and other components to allow browsing the 
WWW with a traditional telephone. The system also 
enables access to e-mail. This system works on the 

30 client-side, and thus requires no special preparation by 
the WWW content-provider. It uses only audio to render 
WWW documents and thus requires no visual display. 

The traditional telephone is utilized to contact a 
host computer which has a voice-capable modem, a 

35 telephone-driven audio WWW browser (TAWB) and a 
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connection to the Internet, called a network interface. 
The TAWB comprises a telephony interface, a digital voice 
processing module (DVP) , an interchange between the 
telephony interface and the DVP, an audio document 

5 renderer, a command and control module, and an Internet 
interface. Additionally, the system contains a friendly 
server for storing information. Finally, the system 
relies on the presence of the WWW which can be described 
as a collection of WWW servers connected to the Internet, 

10. an Intranet or an Extranet. 

Brief Description of the Drawings 

Figure 1 illustrates a block diagram of the present 
15 invention. 

Figure 2 illustrates an overview of a telephone- 
driven audio WWW browser of the present invention. 



20 



25 



Figure 3 illustrates a block diagram of the command 
and control module of the present invention. 

Figure 4 illustrates an example of a touch-tone to 
user command map of the present invention. 

Figure 5 illustrates the high level operation of the 
command and control logic of the present invention. 

Figure 6 illustrates a block diagram of the main 
30 loop of the command and control logic of the present 
invention. 

Figure 7 illustrates a block diagram of the digital 
voice processing module of the present invention. 

35 
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Figure 8 illustrates a block: diagram of th.e friendly 
server of the present invention. 

Figure 9 illustrates an example of a CGI program. 

Detailed Description of the Invention 

Figure 1 shows the high-level architecture of the 
present invention. The user of the system must have a 
telephone handset 10 that has touch-tone capability. 
There are no other requirements placed on the telephone 
handset 10. Notably, the telephone handset 10 may be a 
mobile telephone, such as a cellular telephone. To 
utilize the system, the user dials into a host computer 
system 12. This host computer system 12 has a voice- 
capable modem 14, a telephone-driven audio WWW browser 
(TAWB) 15, and a connection to the Internet, called a 
network interface 18. (Throughout this specification, it 
should be understood that the term "Internet" could be 
readily replaced by "Intranet" or "Extranet" with minimal 
additional changes to the rest of the specification) . 
Typical network interfaces are Ethernet cards or modems. 
Additionally, the system contains a friendly server 20 
for storing information. Finally, the system relies on 
the presence of the WWW, which can be described as a 
collection of WWW servers 22 connected to the Internet. 

At a high level, the system works in the following 
manner. The user employs his telephone handset to dial 
into the host computer. The host computer's voice modem 
accepts the call and acts as an interface between the 
TAWB and the public telephone network. The user then 
issues commands consisting of telephone touch-tones, 
spoken voice commands, or both. The TAWB interprets 
these commands, downloads the appropriate WWW documents 
from the Internet and renders them to an audio stream. 



wo 99/46920 5 PCT/US99/01751 

The TAWB sends the audio stream, via the public telephone 
network, to the telephone handset where the user listens 
to it. As the user listens, he or she may issue 
additional commands which the TAWB will capture and 
5 interpret. For example, the system allows hyperlinks to 
be followed, skips to be made forward and backward 
through the current document, and pauses in the 
rendering. 

The following will describe the Telephone-driven 
10 Audio World Wide Web Browser (TAWB) 16. Figure 2 shows a 
block diagram of the TAWB 16. In consists of a telephony 
interface 23, a digital voice processing module (DVP) 25, 
an interchange between the telephony interface and the 
DVP 24, an audio document renderer 2 6, a command and 
15 control module 27, and an Internet interface 28. The 
Internet interface 2 8 is to be distinguished from the 
network interface 18 shown in Figure 1 in the following 
manner. The Internet interface 28 provides application 
level network protocols such as HTTP and FTP while the 
20 network interface (18 of Figure 1) provides lower level 
network protocols such as TCP/IP. 

An overview of the TAWB 16 follows. The command and 
control module (CCM) 27 directs the action of the other 
modules, interpreting user commands and directing the 
25 appropriate response. The Internet interface 28 provides 
access to WWW servers, from where WWW documents are 
obtained and access to the friendly server. The audio 
document renderer 26 converts structured documents from 
the WWW into an audio rendition. This rendition includes 
30 audio signals sent directly to the telephony interface 
23, but consists primarily of a specially prepared 
structured text stream that is sent to the digital voice 
processing module (DVP) 25. The DVP module 25 converts 
the text stream to an audio "voice" stream and sends this 
35 to the telephony interface 23 by way of the DVP/telephony 
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interchange 24. The DVP 25 also converts voice commands 
received from the telephony interface 23 by way of the 
DVP/ telephony interchange 24 into commands that are 
passed to the CCM 27. The DVP/telephony interchange 24 
is necessary to convert between the dissim^ilar formats 
used by the DVP module 25 and the telephony interface 23. 
The telephony interface 23 captures and delivers audio 
stream.s and touch-tones to and from the public telephone 
network. 

The following will describe the TAWB 16 of the 
present invention in more detail. The Internet interface 
28 is responsible for providing application level network 
services to the CCM 27. Specifically, the Internet 
interface 28 must provide the following well known 
services. Hypertext Transfer Protocol (HTTP): This 
serves to download WWW documents from remote WWW servers 
and also to upload information back to the servers under 
some circumstances. Post Office Protocol (POP): This 
serves to download e-mail documents from remote WWW 
servers. File Transfer Protocol (FTP): This serves to 
transfer files to and from the friendly server. The 
Internet interface 28 takes its direction from the CCM 27 
as described below. It returns all documents which it 
downloads to the CCM 27. 

The CCM 27 directs the operation of the TAWB system 
16. A block diagram of the CCM 27 is shown in Figure 3, 
Within the CCM 27, the touch- tone to user-command map 
(TTUCM) 30 accepts touch-tone digits from the telephony 
interface and determines which user command they 
represent. As commands may consist of varying numbers of 
tones, the TTUCM 30 constitutes a finite-state machine in 
which receiving a touch-tone acts to move the machine 
along an edge. Many TTUCMs are possible, depending on 
the set of commands which the TAWB is to support. 
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An example of a TTUCM 30 is shovm in Figure 4. To 
reduce complexity, not all possible commands are shown. 
A complete TTUCM would be more complex but would follow 
the same principles. In Figure 4, if the first touch- 
tone is a 1, 2 or 3, the TTUCM decides that the command 
is "Follow", "Skip Back", or "Skip Ahead", respectively. 
If the first touch tone is 0, #, or *, the TTUCM accepts 
the next touch-tone and then uses this second tone as an 
index for either the "Get Favorite", "Set Favorite" or 
"Select from History List" commands, respectively. 

The local flags cache 32 is a data-store that keeps 
a local copy of the addresses of documents that the user 
has flagged. The purpose of the flagging process is 
described below. The local favorites cache 34 is a data- 
store that keeps a local copy of the addresses of the 
user's favorite WWW documents. The history list 36 is a 
data-store which stores addresses of pages recently 
visited by the user, in the manner directed by the 
command and control logic 38. It contains a position 
pointer which marks one address as the current address'. 

The command and control logic (CCD 38 directs the 
operations of the CCM. Figure 5 shows the high-level 
operation of the CCL 38. Upon activation, the CCL 38 
through the wait state module 50 is in a wait state. It 
waits for notification from the telephony interface that 
a call has been received and a connection established 
with a user. Next, the CCL directs the Internet 
interface to retrieve the user's favorites from the 
friendly server through the Internet interface retrieval 
director 52. These favorites are then stored in the 
local favorites cache. Next, the CCL through the command 
and control enter module 54 enters the command and 
control main loop, which is described in detail below. 
Upon receiving a quit command from the user via the 
TTUCM, the CCL directs the telephony interface to 
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terminate the telephone connection. This is performed in 
the telephony interface termination director 56. Next, 
the CCL directs the Internet interface to take the user 
favorites in the local favorites cache and store them on 
the friendly server. This is performed in the Internet 
interface store favorites director 58. Finally, the CCL, 
through the Internet interface store user flags director 
60, directs the Internet interface to store the user 
flags from the local flags cache to the friendly server. 

The following will describe the Command and Control 
Main Loop. The main loop of the command and control 
logic is the normal operating logic for the TAWB system 
while it is in use. Figure 6 shows a block diagram of 
the main loop. At a high level, the loop consists of two 
actions repeated continuously: first get a command, then 
execute that command. Commands come either from the 
touch- tone to user-command map (TTUCM) 62 or from the 
voice to user-command map (VUCM) 64 of . the DVP module (25 
of Figure 2) . Note that while either a TTUCM or VUCM is 
required, and both are desirable, only one is necessary 
for the system to. function. A working TAWB system can be 
made with TTUCM only, VUCM only, or both. 

The main loop acts asynchronously from the rest of 
the modules. This means that once the "wait for user 
command" 65 state is reached, new commands may be 
accepted and processed even if other modules have not 
finished their last assigned task. Notably, this means 
that the user may interrupt the rendering of a document 
with new commands, thus allowing a high degree of 
interactivity. 

There are a large number of potential user-level 
commands which may be offered by the TAWB system. These 
commands are those of a traditional WWW browser and are 
well known. There are additionally some commands which 
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are new and relevant in telephone-based browsers. The 
following will describe examples of the most critical set 
of commands. The scope of the present invention is not 
limited to the specific set of commands described but 
5 includes any reasonable set of commands and the new 
commands described, 

After receiving a command, the CCL determines which 
command it is 66. The following describes the handling 
of a set of relevant commands. In the discussion that 

10 follows, the term ^^URL" refers to the Uniform Resource 

Locator, which is an addressing standard used on the WWW. 
Skip Ahead, Skip Back or Restart. In the case of one of 
these commands, the CCM directs 67 the audio document 
renderer to adjust its rendering accordingly. 

15 Reset Favorite N, The CCM stores 68 the current URL in 
the local favorite cache at index N. This process 
overwrites the previous contents of index N in the local 
favorites cache. 

Follow Link. The CCM gets 69 the URL of the current 
20 activ-e link from the audio document renderer, directs 7 0 
the Internet interface to retrieve the document stored at 
that URL, directs 71 the audio document renderer to 
render this document, and updates 72 the history list by 
adding the URL above the current position pointer and 
25 advancing the position pointer. 

Load Favorite N. The CCM lookups 73 the URL stored in 
the local favorites cache at index N, directs 7 0 the 
Internet interface to retrieve the document stored at 
that URL, directs 71 the audio document renderer to 
30 render this document, and updates 72 the history list by 
adding the URL above the current position pointer and 
advancing the position pointer. 

Back Or Forward. The CCM gets 74 the appropriate 
hyperlink from the history list, directs 70 the Internet 
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interface to retrieve the dociiment stored at that URL, 
directs 71 the audio document renderer to render this 
document, and updates 72 the history list by advancing or 
retreating the position pointer as appropriate. 
5 Flag current Page. The CCM gets 7 5 the URL of the 

current document from the history list and stores it in 
the local flags cache . 

Quit. The main loop of the CCL terminates. The CCL then 
continues as specified in Figure 5. 
10 An essential element of the TAWB is the audio 

document renderer (M^R) . The ADR is responsible for 
taking a WWW dociment and creating an audio rendition. 
Some types of documents, are primarily audio in nature 
and these may be rendered directly. For example, 
15 RealAudio streams, waveform audio data, and structured 

audio data like MIDI, can all be rendered directly. For 
documents of these types, the ADR can send the rendered 
output directly to the telephony interface. 

Most WWW documents, however, . are not inherently 
20 audible in nature. Two significant examples are e-mail 
messages and HTML documents which are composed of 
structured text. To convert docioments such as these into 
audio requires a tool such as the WIRE system (described 
in US Patent Application Serial Number 08/768,046). WIRE 
25 accepts a structured document and outputs a text stream 
suitable for rendering by the text-to-speech synthesizer 
(TTS) component of the DVP. WIRE converts abstract, 
semantic mark-up into the literal, syntactic mark-up 
needed to convey the abstractions using audio and in a 
30 form which can be interpreted by a TTS . 

The text stream produced by the ADR is sent to the 
TTS component of the DVP. This text stream is not 
necessarily linear, or ordered in the same way as the 
original WWW document, but instead the ADR may be 
35 directed by the CCM to send arbitrary parts of the 
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document to the TTS • The CCM determines which part of 
the document to send based on the ^^navigation" commands 
sent by the user, such as ^^Skip Ahead" or ^^Skip Back". 
Systems such as WIRE may offer other ""rendering modes" in 
5 which certain parts of a document are summarized or 
skipped based on the user's preferences. 

An additional requirement is that the ADR must 
provide the CCM with the ""active link" when requested. 
This is the hyperlink corresponding to the anchor most 

10 recently rendered. 

The digital voice processing module (DVP) (25 of 
Figure 2) consists of two components as shown in Figure 
7. The text-to-speech synthesizer (TTS)78 accepts marked 
up text from the ADR and generates waveform audio which 

15 is sent to the DVP/telephony interchange. The voice 

recognizer component 7 9 accepts waveform audio from the 
DVP/telephony interchange and sends user command strings 
to the CCM. In a prototype system, Microsoft Speech API 
(SAPI) was used as the interface through which the ADR 

20 and CCM modules could direct the operation of the DVP. 
Using the SAPI interface, there are many commercial 
packages available which can serve the role of the text- 
to-speech synthesizer, such as Lernout & Houspie' s 
TruVoice, and of the voice recognizer, such as AT&T's 

25 Watson. 

The following will describe the DVP/Telephony 
Interchange (24 of Figure 2) . Many commercial DVP 
packages produce or consume audio data in a different 
format than can be accepted by computer telephony 

30 interfaces. The job of the DVP/telephony interchange is 
to convert audio data from one format to another so that 
these two TAWB components can share the data- Since the 
audio is in the form of real-time streams, the format 
conversion must be fast and must be done on the fly. For 

35 example, if the sampling rate produced by the TTS engine 
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is (liferent than that accepted by the telephony 
interface, the interchange must make the conversion . A 
suitable system is described in US Patent Application 
Serial Number ##########, entitled "A Real-Time Down- 
Sampling System For Digital Audio Waveform Data", 
assigned to the same assignee as the present invention, 
filed concurrently with this application and incorporated 
herein by reference- 

The following will describe the telephony interface 
(23 of Figure 20) . A TAWB PC needs an interface to the 
public telephone system. This interface essentially must 
allow the computer to control the modem which acts as the 
local telephony client. Specifically, the interface must 
be able to send and receive waveform audio and touch- 
tones and deliver them to both the TAWB system and the 
public telephony network in an understandable form. In 
practice, many commercial modems provide software drivers 
which conform to Microsoft's Telephony API (TAPI) . Since 
the TAPI definition includes all of the services that the 
TAWB requires, these software drivers may serve the role 
of the telephony interface. 

Although an audio-only WWW browser can be extremely 
useful under many circumstances, it is clear that many 
users will continue to use their visually based browsers 
in circumstances which permit it. Therefore, it is 
convenient to support the sharing of information between 
the two types of browsers. The friendly server (20 of 
Figure 1) is a repository for information that is 
accessible by any WWW-enabled device, including TAWB 
browsers and visual browsers. As some kinds of 
information are difficult to comprehend in an audio-only 
environment, the ability to share addresses with a visual 
browser can be seen as a required feature of an audio- 
only browser. 
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Figure 8 shows a block diagram of the friendly 
server 20. The user favorites store 82 is a persistent 
data repository for addresses of the user's favorite 
documents, that is, the documents he or she has chosen to 

5 associate with preset commands in the TAWB browser • The 
purpose of the user's favorites is to allow the user to 
access certain sites quickly with a minimum of browsing. 
The Internet interface for TAWB browser 84 allows the 
TAWB system to upload replacement favorites which may 

10 have been set by the user while using the TAWB system. 

This interface is simply an FTP server as is known in the 
art. The Internet interface for traditional browser 8 6 
allows the user to modify his or her favorites while 
using a traditional WWW browser. This interface consists 

15 of a CGI program within which the user can modify the 

favorites store and an HTTP server as is known in the art 
which allows the browser to access the CGI program. An 
example of a suitable CGI program is shown in Figure 9. 
This program allows the user to modify the values of the 

20 favorites by typing new URL's in the boxes. Pressing the 
update button completes the modification. In this way, 
the user may easily transfer documents found with a 
traditional browser to his or her TAWB browser. 

The user flags store 88 is a persistent data 

25 repository for the user's flagged URLs. The purpose of 
the flags is to make documents found with the TAWB 
browser available to a traditional browser. The Internet 
interface for TAWB browser 84 allows the TAWB system to 
upload addresses of documents which have been flagged by 

30 the user while using the TAWB system. This interface is 
simply an FTP server as is known in the art. The 
addresses are stored in the form of a WWW document 
containing hyperlinks to the flagged documents. The 
Internet interface for traditional browser 8 6 allows the 

35 user to view this document and follow the hyperlinks with 



BNSDOCID: <WO 9946920A1_!_: 



wo 99/46920 



14 



PCT/US99/01751 ' 



a traditional browser. This interface is simply an HTTP 
server as is knovm in the art . 

In strnimary, the present invention is a system which 
allows a user with an ordinary telephone to browse the 
5 World Wide Web. The present invention is an improvement 
over prior art systems because any WWW docioments can be 
obtained, not just documents that were specially prepared 
for audio access. The present invention works with a 
traditional telephone and does not require the phone to 

10 have a visual display or special internal electronics. 

The present invention is particularly valuable to people 
who need mobile access, are visually impaired, cannot 
afford browsers which require special hardware, or who 
work in environments where visual displays are not 

15 practical. 

It is not intended that this invention be limited to 
the hardware or software arrangement, or operational 
procedures shown disclosed. This invention includes all 
20 of the alterations and variations thereto as encompassed 
within the scope of the claims as follows. 
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CLAIMS: 

1. A system for browsing the world wide web with a 
traditional telephone comprising: 

a host computer system capable of connecting to a 
telephone network and Internet, wherein said host 
computer system comprises: 
a voice modem; 

a telephone-driven audio WWW browser (TAWB) 
connected to said voice modem; and 

a network interface connected to said 
telephone-driven audio WWW browser. 

2. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 1 wherein said 
telephone-driven audio WWW browser comprises: 

a telephony interface; 

a DVP/telephony interchange connected to said 
telephony interface ; 

a digital voice processing unit connected to said 
DVP/ telephony interchange; 

an audio document renderer connected to said digital 

voice processing unit; 

a command and control module connected to said audio 

document renderer; and 

an internet interface connected to said command and 

control module. 

3. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 2 wherein said 
digital voice processing unit comprises: 

a text-to-speech synthesizer; 

4. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 3 wherein said 
digital voice processing unit further comprises: 
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a voice recognizer component; 

wherein said digital voice processing unit converts 
voice commands received from said telephony interface by 
way of said DVP/telephony interchange into commands that 
are passed to said command and control module. 

5. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 2 wherein said 
command and control module comprises: 

a touch- tone to user-command map; 

command and control logic; 

a local flags cache; 

a local favorites cache; and 

a history list module. 

6. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 5 wherein said 
command and control logic comprises: 

a wait state module; 

an internet interface retrieval director; 

a command and control enter module; 

a telephony interface termination director; 

an internet interface store favorites director; and 

an internet interface store user flags director. 

7. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 6 wherein said 
command and control enter module comprises: 

a voice to user-command map; 
a touch- tone to user-command map; 
a user-command wait module; and 
command decision module. 
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8. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 1 further 
comprising: 

a friendly server for storing information. 

9. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 8 wherein said 
friendly server comprises: 

an internet interface for said telephone-driven 
audio WWW browser; 

an internet interface for a traditional browser; 
a user favorites store; and 
a user flags store. 

10. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 1 wherein said 
telephone-driven audio WWW browser comprises: 

telephony interface means for capturing and 
delivering audio streams and touch tones to and from said 
telephone network; 

DVP/telephony interchange means for converting 
between dissimilar formats used by said telephony 
interface means; 

digital voice processing means for converting a text 
stream to an audio voice stream and for sending said 
audio voice stream to said telephony interface means by 
way of said DVP/telephony interchange means/ 

audio dociament renderer means for converting 
structured documents from said world wide web into an 
audio rendition; 

command and control module means for interpreting 
user coinmands and for directing an appropriate response; 
and 

internet interface means for providing access to WWW 
servers . 
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11. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 10 wherein said 
audio rendition comprises: 

5 audio signals sent directly to said telephony 

interface means. 

12 . A system for browsing the world wide web with a 
traditional telephone as claimed in claim 10 wherein said 

10 audio rendition comprises: 

a specially prepared structured stream that is sent 
to said digital voice processing means. 

13. A method for browsing the world wide web with a 
15 traditional telephone comprising the steps of: 

utilizing a host computer system connected to a 
telephone network and Internet; wherein utilizing a host 
computer system comprises the steps of: 

communicating through a voice modem from said 

20 telephone network; 

utilizing a telephone-driven audio WWW browser 

(TAWB) connected to said voice modem; and 

using a network interface between said Internet 
and said telephone-driven audio WWW browser. 

25 

14. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 13 wherein 

utilizing a telephone-driven audio WWW browser comprises 

30 the steps of: 

using a telephony interface; 

using a digital voice processing unit; 

interfacing said telephony interface and said 
digital voice processing unit with a DVP/telephony 
35 interchange; 



BNSDOCID; <VIO 9946920A1_1_> 



wo 99/46920 PCT/US99/01751 

Utilizing an audio document Tenderer connected to 
said digital voice processing unit; 

utilizing a command and control module connected to 
said audio document renderer; and 

using an internet interface connected to said 
command and control module * 

15. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 14 wherein 
using a digital voice processing unit comprises the step 
of: 

performing text-to-speech synthesizing . 

16. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 15 wherein 
using a digital voice processing unit further comprises 
the step of: 

recognizing a voice component. 

17. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 14 wherein 
utilizing a command and control module comprises the 
steps of: 

converting a touch-tone to us er- commands ; 
utilizing command and control logic; 
utilizing a local flags cache; 
utilizing a local favorites cache; and 
accessing a history list module. 

18. A method for browsing the world wide web with a 
traditional telephone as , claimed in claim 17 wherein 
utilizing command and control logic comprises the steps 
of: 

entering a wait state; 
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directing said internet interface to retrieve a 

user's favorites from a friendly servers- 
entering a command and control main loop; 
directing said telephony interface to terminate 

connection; 

directing said internet interface to store 

favorites; and 

directing said internet interface to store user 

flags . 

19. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 18 wherein 
entering a command and control main loop comprises the 

steps of: 

converting voice to user-commands; 
converting touch-tone to user-commands; 
waiting for a user-command; and 
deciding which user command. 

20. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 13 further 
comprising the step of: 

storing information in a friendly server. 

21. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 20 wherein 
storing information comprises the steps of: 

interfacing said friendly server with said 
telephone-driven audio WWW browser; 

interfacing said friendly server with a traditional 
browser; 

storing user favorites; and 

storing user flags. 
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22. A method for browsing the world wide web with a 
traditional telephone as claimed in claim 13 wherein 
utilizing said telephone-driven audio WWW browser 
comprises the steps of: 

capturing and delivering audio streams and touch 
tones to and from said telephone network with a telephony 
interface; 

converting between dissimilar formats used by said 
telephony interface means with a DVP/telephony 
interchange; 

converting a text stream to an audio voice stream 
and sending said audio voice stream to said telephony 
interface by way of said DVP/telephony interchange with a 
digital voice processing unit; 

converting structured documents from said world wide 
web into an audio rendition with an audio document 
renderer ; 

interpreting user commands and directing an 
appropriate response with a command and control module; 
and 

providing access to WWW servers with an internet 
interface. 

23. A system for browsing the world wide web with a 
traditional telephone comprising: 

a computer system capable of connecting to a 
telephone network and Internet; and 

a friendly server for exchanging information between 
a telephone based browser and a visual based browser . 

24. A system for browsing the world wide web with a 
traditional telephone as claimed in claim 23 wherein said 
friendly server comprises: 

an internet interface for a telephone-driven audio 

WWW browser; 
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an internet interface for a traditional browser; 
a user favorites store; and 
a user flags store. 

5 25. A system for exchanging information between a 
telephone based browser and a visual based browser 
comprising: 

a computer system capable of connecting to a 
telephone network and Internet; and 
10 a friendly server connected to said computer system. 

26. A system for exchanging information between a 
telephone based browser and a visual based browser as 
claimed in claim 25 wherein said friendly server 
15 comprises: 

an internet interface for a telephone-driven audio 

WWW browser; 

an internet interface for a traditional browser; 
a user favorites store; and 
20 a user flags store. 
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